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ABSTRACT 

ASYMPTOTIC  NORMALITY  OF  U-STATISTICS  BASED 
ON  TRIMMED  SAMPLES 

Let  Xnl  <. .  .<  Xnn  be  an  ordered  sample  of  size  n.  We  establish  asymptotic 
normality  of  U-statistics  based  on  the  trimmed  sample  Xr  £an}  +  1  <• .  •<  XR  n_[gn]> 
where  0  <  a, 6  <  1/2.  This  theorem  and  its  multi-sample  generalization  are 
illustrated  by  various  statistics  of  importance  for  robust  estimation  of 
location,  dispersion,  etc. 

This  unifies  the  flexibility  of  the  class  of  U-statistics  and  the  classical 
principle  of  rejection  of  outliers.  In  addition,  as  a  tool  in  our  treatment, 
but  also  having  broader  Interest,  a  uniform  version  of  the  central  limit  theorem 
for  U-statlstics  is  provided. 
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1.  Introduction.  For  robust  estimation  of  location,  the  ordinary  sample 
mean  is  too  sensitive  to  outliers.  A  classical  and  successful  alternative  is 
the  trimmed  mean,  for  which  asymptotic  normality  was  established  by  Bickel 
(1965).  As  discussed  by  Bickel  and  Lehmann  (1975),  for  example,  the  trimmed 
mean  remains  relatively  efficient  with  respect  to  the  untrimmed  mean  even  in  the 
absence  of  outliers. 

Viewing  the  trimmed  mean  as  simply  the  ordinary  mean  defined  on  a  trimmed 

sample,  we  are  motivated  to  consider  other  common  statistics  as  well  in  this 

regard.  In  this  paper  we  study  U-statistics  in  such  fashion.  The  class  of  U- 

statistics,  Introduced  by  Hoeffding  (19*18),  contains  a  wealth  of  statistics  of 

interest  in  their  own  rights  and  also  contains  statistics  which  serve  as 

approximations  to  statistics  of  more  complicated  type.  A  very  significant 

broadening  of  the  scope  of  robust  statistical  inference  is  achieved,  therefore, 

by  consideration  of  the  class  of  U-statistics  on  trimmed  samples.  _  . 

Specifically,  the  statistics  we  treat  are  defined  as  follows.  Let 

X . ,X  be  an  i.i.d.  sample  from  a  df  F,  let  X  ,  <...<  X  denote  the  ordered 

In  nt  —  —  nn 

X.'s,  let  0  <  a , B  <  1/2,  and  put  n  -  n  -[an]-[8n].  Let  h(x,,...,x  )  be  a 
1  cits  1  m 

"kernel"  assumed  (without  loss  of  generality)  to  be  symmetric  in  its  arguments. 
For  each  such  kernel  we  consider  the  associated  U-statistlc  defined  on  the 
(a,B)-trimmed  sample,  i.e., 


(1.1) 


n  -1 

U  -  (  aB) 
naB  m 


l  h(X  . X  ). 

r  ni  ni 

C  1  m 

naB 


where  C  „  denotes  the  set  of  m-tuples  ((I, . 1  ) :  [an  ]  ♦  1  <  i  <  . .  .<  1  <  n-[Bn]). 

naB  1  m  -  1  m  — 

(The  cases  a  >  0,  B  =  0  and  a  »  0,  B  >  0  could  also  be  considered  but  will  be 
omitted  for  simplicity.  The  case  a  *  8  -  0  corresponds  to  ordinary  U-statistics 
based  on  full  samples,  for  which  there  is  already  an  extensive  literature  (see, 
e.g.,  Serfllng  (1980),  Chapter  5)). 


AMS  Subject  Classifications  :  Primary  62EPO,  Secondary  62G35 


Key  words  and  phrases  :  U-statlstics,  trimmed  samples,  robust  inference, 
nonparametr lc ,  uniform  central  limit  theory . 
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(1.2) 


For  m  -  1  and  the  kernel  h(x)  -  x,  (1.1)  give3  the  trimmed  mean 
_1  n-[Bn] 


Xna8  ’  naB  i_tJn]+1  Xni  ’ 


treated  by  Bickel  (1965).  For  m  -  2  and  the  kernel  htXj.x,,)  =*  Kx^x^O}  we 

obtain  a  version  of  the  Wilcoxon  one-sample  statistic  treated  by  Saleh  (1976). 

Evidently,  these  and  only  one  or  two  other  cases  of  (1.1)  have  been  previously 

studied  in  the  literature,  despite  the  abundance  of  natural  possibilities.  For 

2 

example,  with  m  -  2  and  hfx^x^  -  (x^x^  /2,  we  obtain  the  statistic 

n-[Bn] 


(1.3) 


s';  -  (n  -i)"1  I  (X  -  X  )2  , 

naB  aB  i-[an]+1  ni  naB 


a  quite  natural  robust  analogue  of  the  classical  sample  variance.  Surprisingly, 
this  has  not  been  examined  previously,  although  a  Winsorized  version  has  been 
treated  by  Jaeckel  (1971)  (see  also  Bickel  and  Doksum  (1977),  p.  375). 

Other  methods  of  using  trimming  to  produce  robustness  have  appeared  in  the 

literature.  For  example,  see  the  "doubly  trimmed  standard  deviation"  of  Bickel 

and  Lehmann  (1976),  and  a  "trimmed  standard  deviation"  introduced  by  Bickel  and 

Lehmann  (1979)  and  treated  theoretically  by  Janssen,  Serf ling  and  Veraverbeke 

(1984).  Also,  consider  the  "trimmed  U-3tatistics" ,  produced  by  trimming  on  the 

basis  of  ordered  values  of  h(X^  ,...,Xi  ),  which  are  a  special  case  of  the 

*1  m 

generalized  L-statistics  treated  by  Serfling  (1984).  The  present  development, 
In  which  the  trimming  is  applied  directly  to  the  sample  values  X^,  is  perhaps 
the  most  natural  and  reasonable  way  to  implement  a  principle  of  rejection  of 
outliers. 

In  Section  2  we  establish  asymptotic  normality  for  statistics  of  form  (1.1) 
(under  suitable  regularity  conditions),  thereby  extending  and  unifying  Bickel' s 
result  for  the  trimmed  mean  and  Hoeffding' s  result  for  the  case  of  untrimmed 
samples.  We  also  cover  the  multi-sample  case.  The  main  results  are  given  by 
Theorems  2.1  and  2.2. 


Our  method  of  proof  utilizes  recent  work  of  Randles  (1982)  on  U-statistlcs 
based  on  kernels  having  unknown  parameters.  However,  an  alternate  approach 
consists  of  extension  of  Bickel’ s  method  for  the  trimmed  mean.  This  is  discussed 
in  Remark  B  of  Section  2.  The  approach  entails  proving  a  result  which  is  of 
general  interest,  a  uniform  central  limit  theorem  for  U-statistics ,  extending 
Parzen  (1954).  We  present  this  result  in  the  Appendix. 

The  remainder  of  the  paper  (Section  3)  treats  important  examples  :  moment- 
type  statistics,  Wilcoxon-type  statistics  and  Gini-like  measures  of  location  and 
spread.  A  number  of  interesting  new  statistics  are  examined. 


2.  General  theorems.  Our  asymptotic  normality  result  for  the  statistic 

2 

U  „  defined  by  (1.1)  will  involve  mean  and  variance  parameters  u  and  o  „ 
naB  aB  aB 

defined  as  follows.  Corresponding  to  the  given  kernel  h,  we  define  for  x,u,v  6]R 


g(XjU,v) 

u(u,v)  - 
A(u,v)  = 


V  V 


m-1 


(1-B-a)  m  I{u<x<v)  h(x,x  ,...,x  )  IT  dF(x  ), 

—  1  m-l  ,  ,  1 


u  u 
v  v 


i-1 


u  u 


_  m 

Eg(X;u,v)  **  ( 1 -B"a)  J...J  h(x^,...,x^)  n  dF(x^), 


Var  g(X;u,v) , 


and 


A(u,v)  =  -g(u;u,v), 

B(u,v)  -  g(v;u,v) . 


Then  we  define 


and 


p  -  p(F  1 (a) ,  F  1 ( 1  - B ) ) 

ap 


a  2  ”  m2{A  +  2ap  A  -  2Bu  „  B  ♦  o(1~a)  A2 

aB  aB  aB  aB  aB  aB  aB 


aB 


A(F  1 (a) , 


♦  8(1-B)  B2  ♦  2aB  A  B  J, 
aB  aB  aB 

F_1(1-B)),  etc. 


where  A 


ASSUMPTIONS. 


(A)  F  has  a  density  f  which  is  continuous  and  positive  at  F  ^a)  and  F  1  ( 1  - 8 ) 
and  is  bounded  in  some  {-neighborhoods  of  F  ^(a)  and  F  '(1~B); 

(B)  For  some  a  <  F  \a)  and  b  >  F  1  ( 1  — 8 )  : 


a< x 1 , . . . , xm  <  b 


I  h(x. . x  )  I  -  M  <  •} 

'i  cn  1  o 


F  1 (1-8)  F  1 (1-8)  m-1 

(C)  The  function  g(x)  -  J  ...  j  ,  h(x,x, . x  ,)  n  dF(x, )  ij 

F_1(a)  F_1(a)  1  m_1  i-1  1 

continuous  at  F  1 (a)  and  F 


THEOREM  2.1.  Let  U  „  be  given  by  (1.1), 

-  nag  - 

(C)  and  that  a2  „  >  0.  Then 
-  aB  - 


Assume  conditions  (A),  (B)  and 


I/O  O 

n  (u  a-V  . )  -  N(0,  <j  } . 
naB  aB  aB 


The  proof  will  utilize  a  series  of  lemmas  involving  certain  U-statistlcs 

closely  related  to  U  .  For  the  given  kernel  h,  and  for  each  u  <  v  e  IH,  we 

naB 

define  an  associated  kernel 


h(x . x  ;u,v)  -  (1-B-a)  I{u  <  x . .  <  v)  h(x,,...,x  ) 

1m  —  1  m  —  1m 


and  we  denote  by  U^u.v)  the  ordinary  (i.e.,  defined  on  the  full  sample)  U-sta- 
tistic  based  on  this  kernel. 

Then  we  have 


n  .  -1 


U  -  (  aB)  (")  (l-B-a)"1  U  (X  ,  ,  .  X  r  ,) 
naB  mm  n  n,[anj+l  n,n-[Bn] 


and  we  readily  obtain 
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LEMMA  2.1.  Under  condition  (B), 


Una8  ”  Un(Xn.[an]*1  ’  Xn,n-[8n])  +  °p(n  1}* 


Next  we  shew  that  the  leading  term  in  (2.2)  may  be  approximated  by 


Tnag  "  Un(F  (a)’  F  (1_B))  yaB  +  u(Xn,[an]+1  *  Vn-Un]5, 


LEMMA  2.2.  Under  conditions  (A)  and  (B), 


Tna»  ’  VXn,[an).,  •  X„, *  V"'"'2’ 


PROOF.  We  make  direct  application  of  Theorem  2.8  of  Randles  (1982),  by 
which  (2.3)  holds  if  Randles'  Conditions  2.2  and  2.3  are  fulfilled.  First  we 
note  that,  by  classical  central  limit  theory  for  order  statistics  (or  by  (2.7) 
and  (2.8)  below). 


This  is  Randles'  Condition  2.2  specialized  to  our  setting.  Next  we  note  that  by 

condition  (B)  there  exists  M  <  »  such  that  for  all  x,,...,x  and  all  (u,v)  in 

1  1m 

some  neighborhood  of  (F  ^a),  F  ^{l-e)). 


(2.5)  |  h(x  . x  ; u ,  v )  -  h(x  . x  ;F  1 (a ) ,  F  1 ( 1 ) )  |<  M,  . 

1  m  l  m  '  —  1 

Now  let  K  be  a  neighborhood  of  (F  ^a),  F  1  ( 1  ~B ) )  which  is  contained  in  the 
rectangular  neighborhood  of  (F  '(a),  F  1 ( 1 - B ) )  in  which  (A)  and  (B)  hold.  For 
(u,v)  G  K  and  for  a  sphere  D  centered  at  (u,v)  with  radius  d,  such  that  DC  K, 
we  have 


sup  I  h(x  , . . . ,x  ;u’ ,v  )  -  h(x  , . . . ,x  ; u , v )  I 
(u’  ,v’  )  e  d  1  m  1  m 

-m  m 

<  ( 1 -B-a)  |h(Xl . xm)  j[  Z  I{u-cKx  <u+d}  n  I (u-d<x .<v+d) 

i-1  J*1  J 

m 

+  Z  Ifv-d<x  <v+d}  n  I(u-d<x.<v+d}l. 
i-1  -  "  J- 


Then,  using  (A)  and  (B),  we  obtain 


El  sup  |  h(X  . X  ;u' ,v‘ )  -  h(X. . X  ;u,v)  I  } 

(u*  ,v’  )  6  D  1  m  1  m  ' 

u+d  v+d  v+d  v+d  v+d  v+d  m 

<  m(l-B-a)"  [(  j  J  ...  j  +  J  j  —  /  )  |  h  |  n  dF(x  )] 

u-d  u-d  u-d  v-d  u-d  u-d  i-1  1 

<  2m  Mo(1-8-a)  m[F( v+d)-F(u-d)]m-1 {[F(u+d)-F(u-d)]+[F(v+d)-F(v-d)]} 


<  M2d, 


for  suitable  choice  of  constant  M2  not  depending  on  choice  of  D.  By  Randles’ 
Lemma  2.6,  his  Condition  2.3  follows  from  our  (2.5)  and  (2.6).  Thus  our  lemma 
follows.  □ 

LEMMA  2.3.  Under  condition  (B), 


,-1,  .  "I 


VF  '(a).  F  '  ( 1 -B) )  -  Uag  -  l  Cg(Xi;F'1(a),F’1(1-B))-paB) 


4  °p(n"1/2)- 


PROOF.  This  is  immediate  from  the  projection  theory  of  ordinary  U-statls- 
tics  (see,  e.g.,  Serfling  (1980),  Chapter  5).  o 


7*. 


S,,1 
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LEHMA  2.4.  Under  condition  (A)  and  (C), 


p(Xn,[anM  ’  Xn,n-[6n])  yo 


mA  ta-F  (F_1(a))]  +  mB  0  C 1~6-F„(F  1 ( 1-B) )  ]  ♦  o  (n'1/2)f 
aB  n  aB  n  p 


where  denotes  the  usual  sample  df  of  X^ . X^. 


PROOF.  By  condition  (A)  and  a  result  of  Ghosh  (1971)  on  Bahadur  represen¬ 
tation  of  order  statistics,  we  have 

-1  “  ‘  F  (F_1(a))  _1/? 

(2.7)  X  ,  ,  -  F  '(c)  -  - ^ -  ♦  o  (n  d) 

n,tan]+1  f(F  ’(a))  p 


1-B-F  (F  '(l-B)) 


X  r„  t  ~  F  (1-B) 
n,n-[6n] 


1  ’  V  l  \  ■  P /  /  -I/O 

’(1-B) - fi-r, -  -  o  (n  1/2). 


f(F  1  (1-B))  P 


By  conditions  (A)  and  (B)  again  along  with  condition  (C), 


-1  -1  -  m  f(F'\a))A 

(u , v )  -  (F  (a) ,F  (1-B))  * 


( u , v )  -  (F  1 (a) , F  ’(1-B)) 


-  m  f (F  ( 1  —  B )  ) B 


Thus,  by  the  multivariate  version  of  Young's  form  of  Taylor's  theorem  (e.g.,  an 
immediate  extension  of  Theorem  C  on  page  45  of  Serfling  (1980)), 


u(u.v)  -  ii  -  m  f(F_1U))  A  a  (u-F_1  (a) ) 
aB  aB 

♦  m  f(F_1 (1-B))  B  (v-F'1 (1-B)) 

ap 

♦  0(J|  (u,v)  -  (F"\a),  F_1(1-B))  II  ). 


Applying  (2.4),  (2.7)  and  (2.8)  in  (2.9),  we  obtain  the  desired  result,  o 


T 


■? 
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AV 


PROOF  OF  THEOREM  2.1.  Define 


4i(x;u,v)  -  m  {[g(x;u,v)  -  y(u,v)]  +  A(u,v)[F(u)  -  I{x£u)] 

♦  B(u,v)[F(v)  -  I{x<v}]}, 


Then,  combining  Lemmas  2.1  -  2.4,  we  may  write 


-i  ^  -1  -i /? 

U  .  -y  -  n  I  4<(X  j F  (a)  ,F  (1-6))  ♦  o  (n  ). 

i-1  P 


naB  aB 


-1  -1  2 
Finally,  note  that  ilKX^F  (a),  F  ( 1  -B ) )  has  mean  0  and  variance  oag-  o 


REMARK  A.  By  Lemmas  2.1  and  2.2,  we  may  write 


(2.10) 


naB 


*  *  -1/2 
y  -  U  +0  ♦  o  (n  ), 

aB  n  n  p 


where  U  is  an  ordinary  U-statistic  with  mean  0  and  asymptotic  variance 
n 

2  * 

parameter  m  A  ,  and  0  is  a  function  of  (two)  order  statistics.  It  can  happen 
aB  n 

2  * 

that  o  „  >  0  but  one  of  these  two  components  is  negligible,  namely  U  if  A  -  0 
a  p  M  • *  o.  p 

*  * 

and  0  if  A  -  B  -  0.  The  case  that  U  is  non-negligible  (i.e.,  A  „  >  0)  can 
occur  even  when  the  ordinary  U-statlstic  based  on  the  original  kernel  h  is 
degenerate.  That  is,  a  U-statlstic  which  has  nonnormal  limit  distribution  when 
defined  on  the  full  sample  can  have  a  normal  limit  distribution  when  defined  on 
a  trimmed  sample.  □ 

It  is  straightforward  to  extend  Theorem  2.1  to  the  case  or  multi-sample 

U-statlstics .  For  simplicity,  we  consider  the  2-sample  situation.  Let 

X  ,...,X  be  an  i.i.d.  sample  from  df  F  and  Y . Y  i.i.d.  from  F  .  Let 

1  n^  l  1  n^  c- 

h(x  . x  ;  y  ,...,y  )  be  symmetric  within  blocks  and  consider 

1  m^  1  m^ 


(2.11) 


naB 


2  n  -[a.n ,]-[&  n  ]  -1 

n  (  1  11  1  1  )  . 

i-1  ml 


where 


CnaB  Vi  Vm,  Vi  Vm2 
:na|  "  {(11 . 1m1iJ1,---*jm_):[aini]  +  1i1^-"<im^nrCBini] 


and  [a2n2]H<j1<...<jm^<n2-[B2n2]). 

For  u<v  e  ir  and  u’ <v’  G  ]R,  define  associated  kernels 

h(x . x  ;y  . y  ;  u,v;u’  ,v’  ) 

1  m  l  m2 

“D,l 

-  n  (1-B  -o  )  h(x  , ,x  ;y  . y  ). 

i-1. 2  11  1  mi  1  m2 

I{u<x  . x  <  v,u»  <  y  . .  <  v’  ) . 

-  1  mi  ~  —  1  m2  ~ 

Denote  by  (x;u,v,u’ ,v’ )  the  conditional  expectation  of  this  kernel  given  X^x, 
and  by  g?(y ;u,v,u’  ,v'  )  the  conditional  expectation  given  Y^y.  Put 

m(u,v,u’,v')  -  Egj (X1 ;u,v,u’  ,v’ )  -  Eg2(Y  ;u,v,u’ v' ) , 

A1 (u.v,u'  ,v' )  -  Var  g1 (X  ;u ,  v  ,u’ , v' ) , 

and 

A^u.v.u' ,v' )  -  Var  82(X1 ;u,v,u’ ,v' ). 

Let  v,  A  .  A2  denote  the  evaluations  of  these  quantities  at 
(Fj’to,).  F^’d-e,).  F2’(a2).  F21(1-B2)). 

Further  let  A  ,  B1  ,  A2,  §2  denote  the  partial  derivatives  of  p(u,v,u’,v’  )  w.r.t. 
u.v.u'  and  v’  respectively,  evaluated  at  ( 1  ( ot  ),  F~ 1  ( 1  -g ,  ) .  fVo,).  F~’  <  1  -B_) ) 


Set  kl  =  Ai/mif1(F1  (aj)  and  B1  =  Bi/mi*VFi  ( 1  - B 1 ) )  for  1-1,2,  assume  that 


- -  -*  X  (i-1,2)  as  min(n.  ,n  )  *  », 

n,+n~  1  12 


1  "2 


and  define 


o'  «  Z 


Wi2  x;  U1  +  2aiuAi  "  2eiyBi  +  «i(1-cx1  )A^ 


♦  Bt ( 1 -B1 )Bi  *  2aieiAiBi]. 


Then  we  have  : 


THEOREM 


2.2.  Assume  that  F  (1-1,2)  and  h  satisfy  (analogues  of)  conditions 


(A),  (B),  (C)  and  assume  o  >0.  Then,  as  mintn^n^)  *  »  such  that 


ni / ( n 1 +n21  +  X^ ( 1=1  ,2) , 


(ni +n2)  (UnaB  '  P)  *  N(0’°2)' 


The  proof  Is  similar  to  that  of  Theorem  2.1.  Also,  from  the  above  formula 
for  the  asymptotic  variance,  it  Is  easy  to  recognize  what  the  asymptotic 
variance  parameter  should  be  in  the  c-sample  case  for  c  >  2, 


REMARK  B.  Bickel  (1965),  In  treating  the  trimmed  mean,  used  a  different 

method  of  proof,  which  was  also  adapted  and  followed  by  Saleh  (1976)  with  some 

gaps  in  the  development.  Such  an  approach  also  can  be  followed  for  our  Theorem 

2.1,  as  follows.  Conditional  on  X  .  ,  and  X  ,  ,,  the  statistic  U  is 

n,[on]  n,n-[BnJ+1  naR 

distributed  a3  an  ordinary  U-statistic  based  on  an  1.1. d.  sample  of  size  n 

cxB 

from  a  certain  df  depending  on  the  given  two  order  statistics.  By  a  uniform  CLT 
for  U-statlstics  (see  Appendix),  we  can  apply  Theorem  2  of  Sethuraman  (1961)  and 
complete  the  proof  in  the  manner  of  Bickcl  (1965).  o 


3.  EXAMPLES. 


3.1 .  Central  momenta 

Here  we  consider  robust  (l.e.,  trimmed  sample)  versions  of  the  classical 
measures  of  location,  dispersion,  skewness,  kurtosis,  etc.  Since  all  central 
moments  may  be  represented  as  U-statistics  (Hoeffding  (1948),  p.  295),  Theorem 
2.1  yields  the  appropriate  results.  In  particular,  let  us  symmetrically  trim 
the  sample  (a-B)  and  consider  the  trimmed  mean  (1.2)  and  the  trimmed  variance 

(1.3) .  Let  us  also  confine  attention  to  df' s  which  are  symmetric  about  0.  Then 
Theorem  1 .2  yields 

(3.1 )  n1/2  X  *  N(0,o?(a,1-o)), 

naa  > 

with 

2  -2  fF  (1~a)  2  -12 

(3-2)  of(o.l-o)  -  ( 1 -2a)  [j  ,  x  f(x)dx  +  2a(F  (a))  ], 

F  (a) 

which  corresponds  to  Bickel  (1965),  and 

(3.3)  °1/2  S^aa  *  N(y(a,1-a),  o2(a,1-a)), 
with 

-1  fF‘1(1-«>  2 

(3.4)  w(a,1-a)  -  (1-2a)  j  .  x  f(x)dx 

F  (a) 

3nd  -1  -1 

-  ?  F  ( 1 -o)  „  F  (1-a)  .  2 

(3.5)  o,(a,1-a)  -  (1-2a)  [  J  x  f(x)dx  -  (J  .  x  f(x)dx) 

2  F  '(a)  F  '(a) 

-1  2rF  ^-a)  2  -14 

-  4a(F  (a))  j  .  x  f(x)dx  ♦  2a(1~a)(F  (a))  ]. 

F  (o) 

This  explicit  result  for  the  asymptotic  variance  makes  it  possible  to  compare 
2 

S  with  competitors  such  as  the  usual  sample  variance  and  the  mean  absolute 
naa 

deviation,  using  an  appropriate  asymptotic  relative  efficiency  criterion  based 
on  asymptotic  variance  parameters. 


3.2.  Wllcoxon-llke  statistics 


In  the  one-sample  case,  assume  F  has  density  f  symmetric  about  A  and 

consider  testing  A  -  0  versus  A  >  0.  This  can  be  formulated  as  a  problem  of 

testing  P{Xj  +  X2  >  0}  -  1/2  versus  P { X ^  +  X.,  >  0}  >  1/2,  with  corresponding 

test  statistic 


U 

n 


(")  1  I  I{X  +  X  >  0}, 
1<l<j<n  1  J 


which  is  asymptotically  equivalent  to  the  one-sample  Wilcoxon  statistic. 
Trimming  leads  to  consideration  of 

I  I(X  ,  •  X  ,  >  01. 


u  -  ( 

naa  2 


[an]+1 <i<J<n-[an] 


nl 


nj 


From  Theorem  2.1  it  follows  by  routine  calculations  that,  under  the  null 
hypothesis  A  -  0, 


(3.6) 


n1/2  (U  -  1/2)  +  N(0,  — 1  — -=) . 
naa  „  ,2 


3( 1 -2a) 

a  result  previously  found  by  Saleh  (1976). 

In  the  two-sample  case,  let  P  -  {(F,G)  :  F(x)  <  G(x),  all  x)  and  consider 
testing  F  -  G  versus  F  *  G,  or  in  turn  consider  testing  PfX^  <  Y^}  -  1/2  versus 
P{X1  <  Y^}  <  1/2.  The  usual  Wilcoxon-Mann-Whi tney  statistic  has  the  following 


formulation  In  the  trimmed-sample  case  : 

1 


n  -[on^]  n^-Can^] 


U 


nin2a  (n1~2[an1  ])  (n2"2[an2])  i-Can^  +  1  J=[an2]  +  1  “l*  "2' 

which  can  be  studied  by  our  Theorem  2.2,  which  yields 

d 


I (X  ,<Y 
n  i  n0J 


(3.7) 


1/P  1  ”  1  1+4 

n  (U  „  -  b  +  N(0 ,  77—-,  — — - 

nlV  2  X(,'X)  12(l-2a)2 


S - ) 


under  the  null  hypothesis,  where  n  -  n^n  ar>d  *  ”  n^n* 
This  result  can  also  be  found  In  Hettmansperger  (1968). 


) 


3.3.  Glnl-llke  measures  of  location  and  dispersion 


For  estimating  location,  Heilmann  (1980)  proposed  to  modify  the  sample 
mean,  which  can  be  expressed  as 


to 


X  -  (")  1  E  median  (X  ,X  ) 
Ki<j<n  1  J 

Ln  -  (")-1  E  median  (X, .X..X, ). 
n  3  1<Kj<k<n  1  J  k 


In  similar  vein,  we  consider  as  competitors  of  the  trimmed  sample  mean  the 

statistics  U  of  the  form  (1.1)  corresponding  to  the  kernels  given  by 
ncxp 

(3-8)  h(x . ,x  )  -  median  (x,,...,x  ) 

i  m  1  m 

for  m  -  3, 4,...  Asymptotic  normality  is  provided  by  Theorem  2.1.  Let  us  note 

that  naive  computation  of  the  statistic  U  corresponding  to  (3-8)  would 

nc»0 

require  0(nm)  steps.  It  is  of  interest,  therefore,  that  for  the  kernel  (3.8) 
may  also  be  represented  as  an  L-statistic, 


(3.9) 


n-[f5n] 

U  „  -  E  c  ,  X  .  , 
naB  1-CanM  ni  ni 


where,  with  (^)  *  0  if  a  <  b, 


(3-10) 


'ni 


i  n  „  _1 

1  ,  aBx 

■  9  (_  ) 

2  m 


[( 


i-[an]-1  n-[Bn]-i 


i-[an]-1  n-[Bn]-l 


rni-1 1  )  ( 
L  2  J 


[I] 


)  +  ( 


tl] 


)  ( 


[Eli] 

L  2  J 


)]. 


Thus,  given  the  c  ’  s  (which  itself  is  a  computational  problem,  of  course),  one 
can  compute  the  statistic  via  the  formula  (3.9)  in  0(n  log  n)  steps. 

For  estimating  dispersion,  Heilmann  (1980)  proposed  to  modify  Gini’ s  mean 
difference,  which  is  the  U-stati3tlc  based  on  kernel  htx^.x^)  ”  I  X1  “  x2  I  *  but 
which  also  can  be  expressed  as 


-14- 


(3.1D 


G  -  (")  E  range  (X  ,X  ) 

1 <i<j<n  J 


(3.12) 


G*  -  (")"’  E  range  (X  ,  X  ,  X  ). 
3  1 <i<j<k<n  3 


In  similar  vein,  we  consider  as  competitors  to  G^  and  Gn  the  class  of  statistics 
UnaB  0f  form  corresponding  to  the  kernels  given  by 


(3-13)  h(x . ,x  )  -  range  (x,,...,x  ) 

l  m  1m 


for  m  -  3.1*.....  Asymptotic  normality  is  provided  by  Theorem  2.1. 

It  is  well-known  that  the  statistic  G  may  be  written  as  an  L-3tatistic 

n 

(see,  e.g.  Serfling  (1980))  and  this  is  shown  by  Heilmann  0980)  to  be  true  also 

» 

for  Gn>  It  is  easily  seen  that  in  fact,  this  is  true  in  general  for  the 

statistics  U  based  on  the  kernel  (3.13),  including  the  case  a  -  &  -  0 
na£J 

(untrimmed  sample).  The  relevant  cni’ 3  for  3  representation  of  form  (3-9)  are 
found  to  be 


n  .  -1 


(3.1*)  cn|-(.aB)  [(‘-tan]-')  .  ("-UM-1)3. 

ni  m  m-i  m-1 


An  alternative  way  of  generalizing  the  classical  Gini’ s  mean  difference  is 
by  considering  the  class  of  U-statlstcs  based  on  the  kernels  of  ferm 


(3-15) 


h(x1  ,x2)  -  |  x1-x?  |  p  , 


for  p  >  0.  This  gives  the  "p-th  power  measures"  considered  by  Bickel  and 
Lehmann  (1979).  However,  results  of  Boos  (1979)  indicate  that  the  case  p  -  1  is 
highly  competitive  to  the  cases  p  *  1 .  It  is  of  interest  to  examine  the  class 
of  "p-th  power  measures"  on  trimmed  samples,  which  is  now  possible  via  Theorem 


2.1  . 


-15- 


Appendlx  :  a  uniform  CLT  for  U-statistlcs 

Here  we  provide  a  useful  extension  to  U-statistics  of  the  uniform  CLT  for 

sample  means  given  by  Parzen  (195*0.  Let  be  independent  IR-valued 

random  variables  having  common  df  F(x;0),  where  0  G  G  C  1R*3,  p  ^  1,  and  let  Ur 

denote  the  usual  U-statistic  based  on  a  given  symmetric  kernel  h(x. . x  ). 

l  m 

Define 

y(0)  -  E0h, 

g ( x , 0 )  -  E0(h(X1 . Xffl)  |  X1  -  x}, 

and 

o2(0)  -  Var  giX,0) . 

0 

THEOREM.  Assume 


(1 )  Eh2  <  K  <  »,  all  0  6  0, 

U 

and 

(2)  o2(0)  >  L  >  0,  all  0g  9, 
for  fixed  constants  K  and  L,  and 

(3)  lim  E  {[g(X,0)  -  y(0)]2  It|  X  |  >  M}}  -  0,  uniformly  in  96  0. 

M-»  0 

Then,  uniformly  in  0  e  0. 

sup  |  Pln1/<2  (U  -y(0))/mo(0)  <  x)  -  $(x)  |  »  o(1). 
x  n 

PROOF.  Using  the  projection  lemma  as  in  Hoeffding  (19*18),  we  decompose 

1  /2 

n  (U  -y(0))/mo(0)  into  a  leading  term  which  is  a  properly  normalized  sum  of 
n 

i.i.d.  r.v.'s  and  a  remainder  term.  By  a  uniform  version  of  Slutzky' s  Theorem 
(Parzen  (1954),  Theorem  18D),  it  suffices  to  show  that,  uniformly  in  0  e  0,  the 
leading  term  has  a  normal  limit  and  the  remainder  term  tends  to  0  in 
probability . 


The  former  is  obtained  from  the  uniform  normal  convergence  theorem  for  sample 
means  (Parzen  (1954),  p.  38),  while  the  latter  follows  by  an  application  of 
Chebyshev' s  inequality,  a 
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